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' To calculate the entropy of a subalgebra or of a channel with respect to 

a state, one has to solve an intriguing optimalization problem. The latter is 
also the key part in the entanglement of formation concept, in which case the 
subalgebra is a subfactor. 
^ ■ I consider some general properties, valid for these definitions in finite dimen- 

sions, and apply them to a maximal commutative subalgebra of a full matrix 
qh| algebra. The main method is an interplay between convexity and symmetry. A 

collection of helpful tools from convex analysis is collected in an appendix. 

•i-H - 

INTRODUCTION 

This paper considers the entropy of a subalgebra or of a completely positive map with 
respect to a state, an entropy-like quantity introduced by A. Connes, H. Narnhofer, 
and W. Thirring. I remain, however, within a rather narrow setting: A pair of 
algebras, *-isomorphic to the algebra of all d x (i-matrices, and to its subalgebra 
of diagonal matrices. I depart from this restriction within this introduction and in 
discussing some tools from convex analysis (lemmata 1, 2, 3, and appendix). 

While the von Neumann entropy is of undoubted relevance for type I algebras (with 
discrete center), the relative entropy can be meaningfully defined even on the state 

"'^ Dedicated to Walter Thirring at his 70th birthday 
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space of an arbitrary *-algebra. There are, depending on the category of algebras and 
states, several quite different ways to do so, 0. 

In 1^ Narnhofer and Thirring proposed a von Neumann entropy definition by the 
aid of relative entropy. At the end of their paper they mentioned a quantity now 
denoted by Hi^{A) or Hi^{B\A) where ^ is a unital subalgebra of B, and u a state of 
B. Abbreviating the restriction of u onto Ahj u, 

uj u := a;|^, 

their definition reads 

H^{B\A) = H^{A) := sup PjS{Coj, Co), AcB {*) 

In this expression S{., .) is the relative entropy for the states of A and the supremum 
has to run through all convex decompositions 

^ = E^i^i 

of the state u on B. {*) was later called "entropy of a subalgebra with respect to a 
state". 

It depends concavely on the state, is always non-negative, and it inherits from relative 
entropy its monotonicity: 

A1CA2CB ^ H^iAi) < H^{A2) < S{lo) 

According to |^ S{uj) = is the von Neumann entropy of uj. 

{*) amounts to calculate a number. Seeing the ease and elegance of the definition one 
might perhaps not believe what a formidable task this is. The calculational difficulties 
are mainly encoded in R, a functional defined by 

H4B\A) + R{B\A, u) = H^AIA) = S{Co) 

where is a state of B. 

All terms are non-negative. If S{uj) < 00 they are finite. R is the convex hull of the 
function u S{u}), as explained below. In the finite dimensional case one may write 

R{B\A,u;) = miY,p,S{g,), 

the infimum being taken over all extremal convex combinations 

u = 'Ypjgj, ^fc pure 

Therefore, R is already determined by its values at pure states. 



2 



In the present paper ^ is a maximal commutative subalgebra. The case of a general 



subalgebra has been considered by Benatti, Narnhofer, and Uhlmann, [|T0|. But their 
main examples concern maximal commutative subalgebras. The same is with the 
paper of Benatti, |Tl|], who shows a relation of (*) to accessible entropy. An example 



with another subalgebra is in [|T^ 



Results and completely independent definitions for the subfactor problem are due to 



Bennett, DiVincenzo, Smolin, and Wootters, [0, who aimed at the i?-function on 
the states of a direct product of two finite dimensional factors. They defined the 
entanglement of formation by 

A a subfactor — > E{uj) = R{uj) = entanglement of u with respect to A 

in order to measure entanglement as a resource for quantum information transfer. 
They obtained estimates and solved interesting examples. Because the reductions of 
a pure state to a factor and to its commutant have the same entropy, there is a nice 
symmetry 

R{Bi^B2\BuUj) = R{Bi(g)B2\B2,iu) 

Hill and Wootters, [|l^], solved the problem for rank two states on the direct product 
of two 2-dimensional matrix algebras. 

The definition (*) can be extended to a completely positive unital map, a, from one 
algebra to another one, 

a : A^B 
Its transpose, a stochastic mapping, 

oj ^ oj o a, {oj o a){A) = u!{a{A)), A ^ A 

maps states of B to those of A. 

To get the definition one has only to set u := u o a within (*) to obtain the desired 
quantity H^{a). This is an invention of Connes, Narnhofer, and Thirring in 0. 

Ohio and Petz called a a channel map acting from the output algebra A to the input 
algebra B, so that its transpose, a stochastic mapping, acts from the states of the 
input algebra into the state space of the output one. In their monograph p|, in 
which they consider the problem within the C*- and the W*-category, H^{a) is called 
entropy of the channel a with respect to the state u. Now the monotonicity property 
reads 

H^{ai o < H^ioii) < S{uj) 

whenever aioa2 is well defined, unital, and completely positive. As known, the latter 
requirement can be weakened to Schwarz positivity 

a{A*A)>\a{A)\^ ^AeA 



^Meanwhile, B.W. Wootters has solved the 2-qubit case completely, quant-ph/9709029 v2 
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GENERAL PROPERTIES 



Let 7i be a Hilbert space of finite dimension diniTi = d, and C a maximal commuting 
subalgebra of B := B(Ti.). Let us denote by Pj = j = 1,2, ... ,d, the minimal 

projection operators of C. They support the distinguished pure states i. e. 

P,BP, = gf{B)P„ \/BeB (1) 

The density matrix of a state of B is contained in C iff the latter is a convex com- 
bination of the pure states The restriction of a state u onto C can hence be 
described by the reduction map 

u-.^:=Y.uj{P^)q^ (2) 

Now we consider entropies. All what is needed is nicely reviewed in . The entropy 
of the restriction uj oi uj onto C reads 

~S[u) := S{u) = -j:^iPj) H^iPj)) (3) 

It is now possible to write down the entropy of C with respect of a state u of B as 
defined by Narnhofer and Thirring Connes and [0. In the case at hand the 
general definition is equivalent to 

H^iC) := S{lu) - R{uj), R{uj) = inf (4) 
where the infimum runs through all convex decompositions 

in the state space Q of B. Rockafellar calls the construction used in defining R the 
convex hull of S. The convex hull of any function on any convex set is always convex. 
Thus is the sum of two concave functions, 5* and —R, and hence concave. 

For the following it is essential that R is the convex hull of a concave function, and 
that fl as well as fl*^^, the set of its extremal points, are compact. Being the state 
space of B, a state is extremal iff it is pure. A state g is pure iff there is a projection 
operator P & B, the support of g, such that PBP = g{B) P for all B in that algebra. 
Notice: lemmata 1, 2, and 3 below are valid for every unital subalgebra, the restriction 
to a maximal commutative subalgebra is not essential for their validity. 

The first conclusion is due to the concavity of S: It is possible to restrict (H) to 
extremal convex decompositions. 
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R{uj) = miJ^PjSiQj), Qk e n""", ^PjQj = (6) 
Let us call optimal every extremal convex decomposition of uj with which the infimum 



is attained, and for which > for all its coefficients |]T0|- Thus optimality is 
expressed by 

R{u)=Y.P,S{qj), QkeQ'^ \fpk>0 (7) 

The graph of i? is a closed subset of the boundary of a compact convex set, W^, see 
appendix. It implies, by standard arguments, that there are optimal decompositions 
for every state. Then, according to Caratheodory, there exist simplicial ones. This is 
the content of 

Lemma 1 

Every u admits an optimal decomposition with at least rank(a;) and at most rank(a;)^ 
different pure states. □ 

I need some further, almost obvious conclusions from the definition of R. For the 
time being a convex subset Qq of Q will be called an R-set if every uj E Qq admits an 
optimal decomposition into pure states of Qo- It is clear that 

a) every i?-set Qq of the state space is the convex hull of its pure states. (Therefore, 
the pure states contained in are just the extremal elements of Qq.) 

b) that R, restricted to Qq, can be computed by optimal decompositions into pure 
states which are all contained in Qq, 

c) and that every face of fl is an i?-set. 
Lemma 2 

Let Qo be an i?-set and fl^^ the set of its pure states. Let F be a convex function on 
Qo which is not greater than R on Q^. Then F < R on Qq. □ 

With other words, on any R-set Qq of the state space, R is the largest convex function 
which attains at every of its pure states, g, the value S{g). Indeed, for an optimal 
decomposition, based on Qq, convexity of F implies 



Riuj) =J2p,Sig,) >J2p,Fig,) > F^ 



UJ 



My next task is to apply this simple lemma to affine functions in order to obtain a 
slight modification of theorem 1 of [|l^]: Q can be covered by convex sets on which R is 
affine. This fact I like to call the roof property of R. The covering consists of "facets" 
with pure states as corners. The covering is not a disjunct one. The intersection of 
two "facets" is either empty or belongs itself to the covering. 
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To obtain the covering, we use the well known possibility to represent a convex 
function on a compact convex domain by an upper bound of affine functions, together 
with the existence of optimal decompositions. 

An afiine function, I, u l{uj) is said to support R iS I < R on Q, and / equals R at 
least at one state. By virtue of lemma 2 one needs to check the inequality I < R for 
pure states only. 

Q is compact and R convex. Hence there exists for every u' & Q at least one affine 
function / supporting R at u', i. e. with /(cu') = R{u'). Thus 

R{u) = sup/(co'), / supports R 
I 

Now, if / is supporting R, let us consider the set 

■={uj' eQ \ R{uj') = l{uj') } (8) 

on which / coincides with R, and let us assume, u belongs to that set. Choosing an 
optimal decomposition one obtains 

Y^Pj^i^j) = ^(^) = ^^(^) = Y^pMoj) 

But < R{0j) as / is /^-supporting. The positivity of the coefficients pj enforces 

equal values of R and / for all involved pure states. This is not the end: I is affine 
and equal to R on some extremal elements Qj. Therefore, by convexity, R < I on 
the convex hull of the pure states gj. But / < -R by assumption. Hence I is equal 
to R on the convex hull of all the pure states g which can appear in any optimal 
decomposition of u. This is already the essence of 

Lemma 3 

Let be defined by (^) with an affine function / supporting R. Then is a 
compact, convex i?-set on which R is affine. 

The family of all where / is /^-supporting, is a covering of Q. □ 
Proof: 

Up to the compactness assertion the proof is already done by the chain of arguments 
above, which can be repeated with every u G In particular, is an i?-set. 

Now R equals S on the compact set fl"^^. Hence both, R and l^^ are continuous on this 
compact set. Hence, the subset of Q'^^, on which both functions take equal values, 
is compact. This compact set of extremal points generates a compact convex set 
(Caratheodory) which must be as it is an _R-set. □ 

Remark: 

B being finite-dimensional, the Hermitian linear functionals span a real linear space. 
An affine function on it is of the form = I'iAi) + a with a real constant ai and 
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with an Hermitian operator Ai. For every state, ui, the constant a can be represented 
by a-times the evaluation of u at the identity. Hence, for every affine / supporting 
R, there is exactly one Hermitian operator A such that the expectation value ui{A) 
equals /(cu), 

l{u) = uj{Ai), Ai = A*eB (9) 

With an Hermitian operator A, satisfying S{q) > g{A) for all pure states g, the 
expectation functional A — > uj{A) satisfies -R(ci;) > uj{A) on the whole state space by 
virtue of lemma 2. If, in addition, equality takes place on at least one pure state, then 
the expectation value of A is supporting R. I shall consider this aspect elsewhere. □ 

There may be many linear functionals I supporting i? at a given state u. For every 
pure state g, appearing in any extremal optimal decomposition of u, we get l{g) = 
R{g) and g & Qi. Hence 

Corollary 

Let cij be a state. The intersection 

:= fl l{uj) = Riuj) (10) 

enjoys the following properties: It is convex, compact, and it contains every pure 
state which can appear in an optimal decomposition of u. R, restricted to fl^, is 
affine. □ 

fli^ is a simplex iff u allows for one and only one extremal optimal decomposition (up 
to the order of its summands). 

Remark: In I have called $aj the convex set generated by those pure states in fi^^ 
which can appear in an extremal optimal decomposition of uj. Clearly, is contained 
in Qi^. Presently the believe, both sets are equal, remains a conjecture. f\ □ 

Lemma 4 

Let H^^ = 0. Then uj is pure. □ 
Proof 

Let us consider an arbitrary convex decomposition (^. Then, by definition of R and 
by concavity of S 

R{uj) < Y.PjS{Cdj) < S{Cu) 

The assumption of the lemma implies equality. But S is strictly concave. Hence uj 
must be equal to ojj for all j. Because every state of the face of u can occur in a 
convex decomposition of uj, the whole face is reduced to a single state on C by (@). For 
a maximal commuting subalgebra such a face cannot contain more than one state. 

■^Thanks to the referee, who pointed at the gap. 
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Remark that the inverse statement is evident: If q is pure then H[q) = 0. □ 

In the following u ^ lo denotes a complex conjugation such that Pj{i^ — uj)Pj = for 
all j. In a suitable base for the density operators the complex conjugation changes 
the off-diagonal entries to its complex conjugates but does not change the diagonal. 
li cu — cu, the state is called real. 

Lemma 5 

If g and g both appear in an optimal extremal decomposition then g — g. 

Let U e C be a unitary. If g and its transform g^ both appear in a proper optimal 
decomposition, then they are equal. □ 

Corollary 

The set of real states is an R-sei. Every pure state occurring in an optimal decom- 
position of a real state is real. □ 

Proof. Let ^ be a pure state and r = {g + g)/2. Then g, g, and r have the same 
reduction to C and the same S*- value. Assume the two extremal elements would 
appear in an optimal decomposition. Then R is afiine on their convex hull, (lemma 
3). Hence R{t) = S{t). By lemma 4 r has to be pure implying g = g. The same 
chain of arguments is vahd in the other case of the lemma. □ 



USING SYMMETRIES 

If only U*C U = C is required, things are not covered by lemma 5. These unitaries 
form the normalizer of C in B. They permute the minimal projection operators Pj 
of C. Let U a unitary from the normalizer. Then there is a permutation i — > 
with UPiU* — Pj{i). This way we obtain the well known homomorphism from the 
normalizer onto the permutation group of d = dimTY elements. Let us call U a 
transposition iff it interchanges two minimal projections while the other ones remain 
unchanged. 

My aim is to consider optimal decompositions of states which are generated by sym- 
metries. If [/ is a unitary, uj^ is defined by u'^{A) = u{U AU*) for all A in the algebra. 
The computations are conveniently done by the help of density operators. Using the 
trace of B, the latter is defined by 

u;{A)^TtDA, D^D^, VA (11) 

If u; is transformed to u;^, the density operator becomes U*DU. 

The rank of a state is by definition equal to the rank of the smallest projection 
operator, say Q, satisfying u{Q) — uj{l). Q is called support of u; and of the operator 
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D = D^. We mention the equality of the rank of u with the dimension of the 
supporting subspace QTi. We shall need further 

■= Q n, := QBQ = Bin^) ^ Mk (12) 

where k = ranker. Q is the unit element of If A & B^^ is of rank k, then A is 
invertible in that algebra, i.e. there is a unique B & B^ with AB = Q. In particular, 
every positive power D"^ is invertible in Below this will be used with A; = 2, in 
which case things can be controlled explicitly by the help of Pauli operators. Before 
going to that issue, let us rewrite (j^) with the projection operators, Pj, of our maximal 
commutative subalgebra C 

^M = 5:.(TrP,D) 
Consider now a rank two state uj with density operator D and support Q. Then 

]^q = {i-i:vdWd-d^), (13) 

Lemma 6 

Let be a state of rank two and t/ be a transposition such that uj^ = cu, so that 
its density operator D = commutes with U. Then the following properties are 
equivalent: 

(a) There is no other [/-invariant state in than uj. 

(b) uj allows for an optimal decomposition of u of length two, and at least one element 
of does not commute with U. 

(c) consists of two elements which are interchanged by U. □ 
Proof. To be definite we choose a transposition U fulfilling 

UPi = P2U, UPj = PjU, Vj > 2 (14) 

There is a 180°-rotation in B^j through the action of U. (If not, all elements of that 
algebra had to be [/-invariant, contradicting every of the three properties, a, b, c.) 
We choose matrices, aj, in this algebra satisfying the algebraic properties of the Pauli 
matrices, with defining the rotational axis of U. We are allowed to require 

as := U Q, ajU + Uaj = 0, j = 1, 2 (15) 

because U commutes with U by virtue of (pr3|), and, being a transposition, U = U*. 
Therefore, of (p!5[) is Hermitian and its square equals Q. But Q 7^ UQ because U 
induces a non-trivial rotation of the supporting subspace. As we now can see, every 
[/-invariant operator in B^j is a linear combination of Q and (T3. In particular. 
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D = ]^{Q + x^a:,), ^(1 + x^) = Tr Z^^ 



(16) 



With any pure q also is contained in Vl^. Assuming property (a) of lemma 6 we 
obtain the optimal decomposition 

\{Q + d')=u^ (17) 

Thus (a) (b). 

The density operator of any pure q satisfying (|1^ must be of the form 

De = \{Q + 11^3^3). x\ + xl = l-xl = 2{l-TiD^) (18) 

For j > 2 the projections Pj commute with U. But Pjak, k = 1,2, change sign if 
transformed with U. Their traces must be zero. This implies 

TrPjD = TiPjD^, j>2 (19) 
^ Tr (Pi + P2)D, = Tr PiD = Tr P2D (20) 

In order that (c) follows from (b) there should be only two choices for g if optimality 
is required. By ([19| ) and (|20|) the remaining possibility to get S{g) as small as possible 
is in making the modulus of the difference Tr(Pi — P2)Dg as large as possible. To do 
this is the next aim. 

If the above mentioned difference is zero, then TiPjDg = TiPjD for all j. But then 
the entropy of g and u would be equal, and, consequently, H^^ = 0. By lemma 4 this 
contradicts the rank two assumption for u. Therefore, the first of the operators 

D,{P^-P2)D„ Q{Pi-P2)Q, Vd{P^-P2)Vd 

is not zero. (Remark Dg = ^Jd^, for g is pure.) But the first operator results 
from multiplying the second one from the right and the left by D^. Thus the second 
operator is again not the zero of And, lastly, the root of D is invertible in B^^. This 
means the third operator is not the zero. All the operators of our list are Hermitian. 
All change sign if transformed with U. Hence they are real linear combinations of 
(Ji and (J2. We use the remaining freedom in the choice of the Pauli operators by 
requiring 

Q (Pi - P2) Q = y cji, y>0 (21) 
Multiplication with and taking the trace gives 
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Tr (Pi - P2)D, = xiy (22) 



according to (p^). The left hand side is maximal (minimal) iff \xi\ is as large as 
possible. This takes place if and only if X2 = 0, see (plsl) . 

That proves (b) —>■ (c). 

Indeed, with X2 = 0, just one choice for xi > is allowed, 

1 . „ 1 



^7 = {9,9"^}, D, = -iQ + xiai + x^as) = D + -^l- xla^ (23) 

Evidently, (a) follows from (c), and the proof is done. □ 

Let us register the following observation: 

Corollary 

Let uo satisfy the conditions of Lemma 6. The choice (|T5|) , ( ^1]) of the Pauli operators 
(Tj depends only on the support Q oi u. □ 

To see it, remind that Pj are rank one projections and that the traces of PiQ, P2Q 
are equal. We get y by squaring (PT|) and taking the trace. Reinserting in (BTl) yields 



Tr(Pig) Tr(P2g) - Tr(PiQP2Q) ) = Q{P, - P2)Q (24) 



Similarly we see from (p!8|) and by sandwiching (ETF) with v D 



D{Pi-P2)^fD = ]^xiyai (25) 
Squaring and taking the trace comes down to 

^ xly^ = Tr(PiD) Ti{P2D) - Tr(PiZ}P2D) (26) 

To be able to calculate S{g) = R{lj) we need the traces of D^Pj, j = 1,2. This can 
be done by combing (|20|) and (p2|): 

Tr P,D, = TiiPiD) + {x,y)/2, Tr P2D, = Tr(PiD) - (xiy)/2 
Now, because of (p6D, explicit expressions for R and are 



R{u) = s(Tr(PiD) + ^xiy) + s(Tr(PiD) - ^xiy) + s{TiP,D) (27) 

^ ^ i>2 
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= 2s(Tr(PiD) - s(Tr(PiD) + h^y) - siTi{P^D) - ]^xiy) (28) 

provided the assumption of lemma 6 are satisfied. The trace of PjD is the expectation 
value of D with the vector Similarly, ( PBD may be written 

\xly' = {l\D\l){2\D\2)-{l\D\2){2\D\l) 

Looking at all this, the difficulties in extending lemma 6 to symmetric density oper- 
ators of higher rank are as follows: ([13| ) becomes an equation of degree rank(a;) , and 
the number of parameters goes quadratically with the rank. 

There is a remarkable outcome of lemma 6. With an arbitrary pure state q and a 
given transposition U there is a twofold alternative. At first, either ^ = or the 
arithmetic mean (|17D of q and is of rank two. In the latter case Vt^ is [/-invariant. 
Hence, either the conditions of lemma 6 are satisfied or they are not. In the latter 
case, Q is not optimal. However, because of the symmetry, there is necessarily at least 
one optimal pure qi in VL^^ such that lemma 6 applies to the arithmetic mean of qi 
and q\. Hence consists of one or more pairs of pure states, which are pairwise 
permuted by f/, and, possibly, of some [/-invariant pure states. 

Corollary 

If uj is [/-invariant but VL^ is not elementwise [/-invariant then Vt^^ contains at least 
one state to which lemma 6 applies. 



SYMMETRIC REAL DENSITY OPERATORS 

Let us compare the treatment above with that of some highly symmetric density 



operators of maximal rank according to |]T0[ . Assuming uj real., every optimal decom- 



position of uj is real (lemma 5). Even more essential, D = D^, the density operator 
of u, is supposed to commute with all permutation matrices, U, fulfilling UCU* = C. 
In the following the latter assumption is always requiered. 

To every permutation, vr, there is a unique permutation matrix, [/^r, in the normalizer 
of C. These matrices are real unitaries with entries or 1, and in every row and 
every column there is just one 1. If a real density operator is commuting with all 
real permutation matrices, only one free parameter remains: It is the common value, 
z, of the off-diagonal elements. The diagonal elements equal 1/d, d the dimension of 
our Hilbert space. The common off-diagonal value is bounded from above hj 1/d and 
from below by —l/d{d — 1). 

Now let a pure state g with density operator appear in an optimal decomposition 
of a real permutation invariant state. Then every transform of ghj a permutation 



12 



matrix is contained in Therefore, lemma 3 shows optimahty of the decomposition 
(which is not necessaily short) 

^ = ^ E U.D, U: (29) 
One of the relations following from (^) reads 

TrD^D = TrD^ = ^ + d{d-l) (30) 

We may write 



D,= W){V\ (31) 

with a real unit vector Lp. Denoting by 0i, . . . , 0^ the components of in a base that 
diagonalizes the minimal projections Pj of C, the relation ( |30[ ) implies 



^0fc = a, a = ^1 + zd{d-l)>Q (32) 

where the sign of the real a is fixed by a > . This seemingly harmless convention has 
an important effect. Being real, is defined by (0) up to a sign. If a 7^ 0, this sign 
has been fixed by (0). (|32|) is an affine hyperplane, intersecting the {d — l)-sphere 
spanned by the real unit vectors ip. As long a > the map — is a section 
from the real pure states into the Hilbert space. For a = we get a double covering 
because with Lp also —if belongs to the sphere. That is, in the limit a — > the simple 
covering bifurcates to a double covering. 

The point for all this comes from lemma 3, showing that ( pUj ) implies R{uj) = S{g) 
because R is affine on the convex set generated by an optimal set of pure states. 

Thus we have to minimize >S'(^) on the intersection of the {d — l)-sphere of real unit 
vectors with the hyperplane (0), i.e. on a (rf — 2)-sphere Its radius r in Hilbert 

space turns out to be 



r = r(Sr) = f - ^ = ^(l^illL^ (33) 

From z = 1/d, where it degenerates to a point, the radius grows up to one with 
growing z. At the same time a goes from ^/d to zero. 

Let G S^""^ and denote by ip^ its antipode on that sphere. Then their Hilbert 
distance is twice the radius (Bl), which amounts to 
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((^,<^^) = 2^-l = l-2r2 (34) 

so that the transition amphtude remains positive as long as the radius does not 
exceed tq := \/(lE. Thus, for < r < tq, the Bures distance of the states and 
\ip^){ip-^\ is equal to the Hilbert distance of ip to its antipode ip-^. But for tq < r < 1 
the transition amplitude becomes negative and the mentioned Bures distance gets the 
value 2vl^^r^. This can be rephrased as following: Within < r < 1 the sphere 
iS^^^ is one to one mapped into the state space. This mapping is locally isometric. 
The local isometry is a global one for < r < tq. But it becomes globally deformed 
if r is larger than tq in order to "prepare" the bifurcation at r = 1. Because of the 
described scenario something should happen with the optimization and its outcome 
R. What it is, is definitely known in case d = 3, and will be described below. 

For the next considerations I assume d = 3. With d — 2 = 1 the optimalization 
takes place on an 1-sphere. There are three permutation matrices which are trans- 
positions. They are denoted by U12, U23, and f/31. In particular, the real unitary U12 
interchanges the components 0i and 02 of ip, while 03 remains unchanged, and so 
forth. The product of any two of the three transpositions is a cyclic permutation of 
the components of ip. 



Now I return to an important result of |jTO[ which clarifies the structure of fi^^ in its 
dependence on z. 

There is a special z-value, the bifurcation value z^,, which is —0.14 approximately. For 
values —l/6<z<z^ the convex set fl"^ is an hexagon. But for < z < 1/3 it is a 
triangle, i.e. a simplex. 

In the triangle case there is, up to reordering of its extremal states, exactly one short 
optimal decomposition of u. It is of length three and explicitly known DTof. The 



following representation of their density matrices Dj is in |TT|]. It is 



^ = + + Dj := 3 y/ujPjy/uJ (35) 

This representation is equivalent to (PPj): The 6 = 3! terms in ( P^ become pairwise 
equal. Every transposition permutes two of the three pure states in allowing for 
an application of lemma 6, but let the third one unaffected. On the other hand, a 
cyclic permutation matrix induces a cyclic reordering of Q'^, i. e. of three pure states 
of (H). 

More involved is the hexagon case. After the bifurcation value every of the three 
optimal pure states of the simplicial decomposition splits into two other ones. That 
is to say, from every one of the three pure states qj, j = 1,2,3 of the triangular 
optimal decomposition originates two new ones, Qja and Qjb- A transposition, say 
U12, previously interchanging 1^2 but letting 3 fixed, now does a more complicated 
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job: la 2b, lb ^ 2a, and 3a ^ 36. The pair g^a, Q^b, together with U12 allows for 
the application of lemma 6. 

The states labelled by a are interchanged by a cyclic permutations, and the same 
is with the 6-states Qi^j. From that one obtains two essentially different simplicial 
decompositions, 

^ = ^{Qla + Q2a + Q3a) = ^(^'ife + Q2b + QSb) (36) 

However, no pair with the same index "a" satisfies the assumptions of lemma 6. The 
same is with respect to the index "6" : Only states on the line segment containing two 
neighbored extremal points can have a unique extremal decomposition. 

We already know: Something appears if z goes down to —1/6. The Study-Fubini 
distance of the pairs of pure states labelled by (la, 26), (2a, 36), or (3a, 16) respectively, 
is diminishing. The distance finally becomes zero for z = —1/6, resulting in Qia = g2b, 
and so on. The hexagon bifurcates and becomes again a triangle in state space. 

In the Hilbert space one gets an equilateral hexagon at 2 = —1/6. As explained 
above, it becomes our triangle in state space by identifying the vectors and —(p 
(Hopf bifurcation). From this point of view it really looks as if we had to compensate 
that Hopf bifurcation by the bifurcation of the optimal decomposition rule at z^. 
If this impression is correct, the appearance of z^, is necessary by general geometric 
reasons. Only its value should come from the particular properties of the function 
s{x). 

Let us compare this reasoning again with lemma 6. Let ^ be a real pure state, U a 
real transposition that does not commute with g, and denote by u' their arithmetical 
mean, u' = {g + g^)/2. If the latter is not an optimal decomposition, then is 
spanned by more than two extremal states. The assumption, that we then fall into 
the triangle or hexagon case, is compatible with the symmetry and geometrically 
tempting. This conjecture reads: 

Let g a real pure state, U he a. real transposition, and g 7^ g^ . If the assumptions 
of lemma 6 do not apply to the state {g + g^)/2, then g belongs to an optimal 
decomposition of a real and maximally symmetric state. □ 



The conjecture is supported by numerical studies and the results of ||Tn|. To get a 
complete proof, one has to exclude further bifurcations. I do not know how to achieve 
this. □ 

Remark: For d = 2 every pair g, of pure states, V a transposition of the minimal 
projections of C is optimal. Indeed, this remains true if — xlnx is replaced by an 
arbitrary concave s[x) with s(0) = s(l) = 0. □ 

For (i > 3 a similar analysis is preliminary only. To obtain a pure state |v?)(v^| belong- 
ing to an optimal decomposition (E9|) it suffices to restrict oneself to the following 
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assumption: The components of (p do not attain more than three different values. 
This can be shown by straightforward variational analysis. 

Moreover, if the components of (f) attain only two different values, g is either a local 
minimum, a maximum, or a turning point of R. Nearby z = 1/d the vector ip with 
components 

^ 0, > 0, 01 > 02 = 03 = • • • = 0d (37) 

gives at least a local minimum of S{uj) which is presumably a global one. The f/^- 
transforms of Dg = \(p){ip\, where ip satisfies (|37D , generate a simplex spanned by d 
extremal states. For z- values satisfying (|32D and (|37D, and such that (^) becomes 
optimal (though not short), the simplex decomposition will be 

a; = - ^ gj, Dj := d Pj ^Ju) (38) 

This is supposed to be the counterpart, for (i > 3, of the d = "i triangle case. Of 
course, much more has to be known to clarify the c? > 3 situation even in the real 
and maximally symmetric case. 

What is to learn about the role of symmetries from all that? Given a state tu, one is 
tempted to look at the subgroup 

r(cj) := {f/| f/*Cf/ = C, f/*r]<^f/ = fi^} (39) 

of the normalizer of C As seen in the previous examples, a certain classification can 
be reached by examining to the detail the action of r(ci;) on the pure states contained 
in Is it always true, as in the examples considered above, that there is exactly 
one ijj' e which is invariant with respect to T{u) ? And, if yes, is this group large 
enough to get an optimal decomposition of a;', starting with any of its optimal pure 
states? 



Appendix: Roofs 

A function enjoys some very nice properties if defined according the rule of (P). Some 
of them have been used by Benatti, Narnhofer, and Uhlmann |T^, by Uhlmann | ]14| | , 
by Hill and Wootters , and others to examine either the entropy of a channel or of 
a subalgebra with respect to a state, 0, or the entanglement of formation, [T^. They 
can also provide computational help to Holevo's channel capacity 0. In addition 
there are connections to the optimalization problem of accessible entropy shown by 
Benatti They explain certain similarities to results of Davies [^, Levitin P, 

Fuchs and Peres . 
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In the following I treat these general properties within an abstract setting. Its first 
requirement is as follows: 

ASSUMPTION 1: Q is a compact convex set in a finite dimensional real space C. 

Remark: In most physical apphcations fl is the state space or the space of density 
operators of an algebra *-isomorph to a full matrix algebra. H denotes an 

Hilbert-space of finite dimension d. Fixing Q to be the convex set of all density 
operators, £ is the real linear space of Hermitian operators, Herm(7Y), of Ti. Then 
C is of dimension d"^. The dimension of $7 is rf^ — 1. Only for reference within the 
appendix I call this the standard setting. It is convenient to require 

n:=dim£= l + dimfi, ^ fl (40) 

This provides the following: Because the zero of C is not contained in Q, the linear 
span of Q coincides with C. Choose r G O arbitrarily. To every z/ G £ there is one 
and only one real number A such that v — \t G VL. For the remainder r is chosen once 
for all as a reference state. It is often convenient, though not necessary, to assume 
invariance of r against all affine automorphisms of Jl. (r is then called maximally 
symmetric.) 

ASSUMPTION 2: The set of extremal elements, Q.^^, ofD, is compact. A continuous 

function, Q ^ f{Q), is given on VL'^^. □ 

The next aim is to extend the function given on the extremal boundary of Vl to the 
whole convex set VL. Of course, there are many ways to do so. But there exists 
two distinguished among them, respecting maximally the convex structure of Q. For 
reasons which will became evident soon, I call them the the convex and the concave 
roof based on f . The convex roof, /'"^, is defined by 

r^H:= inf^p,/(^,), ^,gQ^ Y.PjQ3-^ (41) 

where the infimum runs through all extremal convex decompositions of uj. Completely 
similar, the concave roof, /^"p, is defined by 

rPM:=sup$:p,/(^,), ^feGQ^^ Y.P3Q3-^ (42) 

Because — /™p = (— /)"^^ every property of convex roofs can be translated in one for 
concave roofs, and vice versa. Evidently f^^^ > f^^^. 

The task is now to show how the graphs of /™p and unite to the boundary of a 
compact convex set S of dimension n. It will be done by a construction depending 
on the reference state r. The set 

S^":={/(^)T + ^| gen^"-} (43) 
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does not contain convex linear combinations of their elements with the exception 
of the trivial ones. Otherwise Q'^^ could not be a set of extremal points of a con- 
vex set. Continuity of / and compactness of fi*^^ imply compactness of S*^^. Hence 
(Caratheodory) 

Lemma A-1 

The convex hull S of is compact. S''^ is the set of extremal points of S. □ 
z/ G H iff 1/ allows for an extremal decompositon 

resulting in 

Xr + ueE < — > < A < /™P (44) 

The compactness of S ensures the compactness of the A-interval defined by (0). It 
follows the existence of optimal decompositions with which the "inf" in ( Pl) or the 
"sup" in (^) are attained respectively. Moreover, \t + oj belongs to the boundary 
of S iff A equals either /'"^(a;) or f^^'^{uj). The dimension of its face cannot exceed 
n — 1. Caratheodory 's theorem guaranties optimal decompositions of length n. 

Lemma A-2 

as well as f^^"^ allows for optimal decompositions of length not exceeding n. □ 

There is another construction, valid on every convex set. Let uj — * G{uo) denote 
an arbitrary function on VL. Its convex hull is defined by 

G;^f{u) ■.= i\il^PjG{ujj), ujk^^, ^PjUJj = uj (45) 

where the inf runs through all representations of uo by convex combinations, i. e. not 
necessarily extremal ones. It is known, see and easily verified, that every convex 
hull is a convex function. 

In the same spirit one my define the concave hull of a function just by replacing the 
"inf" in (|45| ) by "sup" . With this definition the concave hull of a function is concave. 

Now assume in (^5|) a concave function G. Then we may restrict ourselves to extremal 
decompositions to get the desired infimum: The convex hull of a concave function 
depends on it values at the extremal points only. Now it is straightforward to see 

Lemma A-3 

is the convex hull of /^^^p. is convex. Let F be a convex function which is 
not larger than / on Vt^^. Then F < on Vt. 

f^'^^ is the concave hull of /™^. /'^"^ is concave. Let F be a concave function which is 
not smaller than / on Q^^. Then F > /^"p on fi. □ 
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Remark A 1.2: Let us consider the standard setting where f2 is a state space and S 
the von Neumann entropy. If a; — > a; o a denotes an affine mapping of the state space 
into itself then 

uj S{uj) := S{uj o a) 

is a concave function on Q. Its convex hull, S'inf, is denoted by R in |T0[ and and 
by E for "entanglement" in and |T^. Let / denote the restriction of S onto Q'^^. 
We have 

s > > r' = ~Sn.u H^=~s- 5inf > - r' 

Because all the functions are non-negative, they are defined for general state spaces 
(say in the C*-category) if the von Neumann entropy remains finite on o a. □ 

Now a further notation is introduced. Let F be a function on VL. A set of extremal 
points of Vt is called F- optimal if and only if F is affine on its convex hull. 

I call F a roof if every element uj is contained in the convex hull of an F-optimal set. 

This is consistent with the notations above: The concave and the convex roof of a 
function /, which is defined on the extremal points, are roofs. It is the content of 
theorem 1 in |10||. As already indicated in the main text (lemmata 1-3), one can do 
a little bit more. What there is called will now be denoted by 1)™^ to distinguish 
it from Qf^P. 

JIJ^^ is the smallest convex set containing all extremal points of all extremal decom- 
positions of u which are optimal for /™^. 

is the smallest convex set containing all extremal points of all extremal decom- 
positions of UJ which are optimal for f^'^v. 

Lemma A-4 

and /'^"P are roofs. They are affine on 1]™^ and Q^^^ respectively. 
Corollary 

The convex hull of a concave function and the concave hull of a convex function are 
roofs. 

Two convex (or two concave) roofs are equal iff they coincide on the extremal points. 

□ 

The proofs are mere reformulations of those in the main text. They can be done also 
more explicitly as in \W{ . 

Remark: We may now rephrase the definition of H^^ as following: H^^ vanishes on 
and it is the sum of 5* and of a concave roof. □ 

What happens if equality holds, /'^"p(co') = /"^(cu), for a certain uj. From the very 
definition 



UJ 
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so that every extremal decomposition is optimal for both roofs 
the face of co in Q. Now the roof property implies: 

Lemma A-5 

Let /'^"P(cu) = /'""^{(jj). Then 

^inf _ ^sup ^ f^^g of o; in Q (46) 
and is equal to on the face of u. 
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. That implies f2™ is 
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